Speech Rate and Prosody Units: Evidence of Interaction from Mandarin Chinese
نویسنده
چکیده
This paper discusses evidence of interaction found between speech rate and prosody units in Mandarin Chinese speech. Mandarin speech data of 2 different speech rates that had been previously labeled for perceived boundaries and prosody units were further analyzed for duration patterns at each prosodic level. Each prosody level demonstrated patterns of duration adjustment for both speech rates that could be accounted for by the model used. These patterns of duration adjustments are clearly systematic, suggesting how each prosody levels may interact and to an extent govern the temporal distribution of units within. Our findings demonstrate that though speech rate may appear to be a global phenomenon across speech flow on the surface, it in fact is very much an in integrated part of prosody organization constrained by each prosody level. To put simply, duration adjustment is being made systematically at each prosody level during speech production instead of just an across-the-board phenomenon. As a result, interactions between prosody units and temporal distribution are predictable. We believe these findings are a step forward in understanding temporal organization and distribution of speech flow as well as speech prosody in general, and should be directly applicable to predicting speech prosody of unlimited TTS in particular.
منابع مشابه
Recognizing Mandarin Chinese Fluent Speech Using Prosody Information—an Initial Investigation
The aim of the present paper is to demonstrate how prosody information could be used to recognize Mandarin Chinese fluent speech and what the recognized results imply. By applying our hierarchical prosody framework for fluent speech [1, 2] that specifies boundary breaks and boundary information across phrases and group phrases into speech paragraphs, we were able to develop software that automa...
متن کاملMandarin speech prosody: issues, pitfalls and directions
From the perspective of speech technology development for unlimited Mandarin Chinese TTS, two issues appear most impedimental: (1.) how to predict prosody from text, and (2.) how to achieve better naturalness for speech output. These impediments somewhat brought out the major pitfalls in related research, i.e., characteristics of Chinese connected speech and the overall rhythmic structure of sp...
متن کاملA Unit Selection-based Speech Synthesis Approach for Mandarin Chinese
The paper presents a unit selection-based speech synthesis approach for mandarin Chinese. Unit selection-based approach generates speech by selecting proper units from a speech corpus and connecting them together. In this approach, a set of features are defined to describe the speech units in the corpus and the expected units in the synthesized utterance. Based on the features, cost function is...
متن کاملCollecting Mandarin Speech Databases for Prosody Investigations
The prosody of Mandarin running speech is notably marked by grouping of short phrases into perceptually identifiable larger units in the speech flow. An organization of Mandarin speech prosody should not only account for the grouping phenomenon, but also offer some explanation for such grouping in relation to information of other linguistic levels as well as speech planning. The physical, phone...
متن کاملEvaluating prosody of Mandarin speech for language learning
This paper proposes an approach to automatically evaluate the prosody of Chinese Mandarin speech for language learning. In this approach, we grade the appropriateness of prosody of speech units according to a model speech corpus from a teacher’s voice. To this end, we build two models, which are the prosody model and the scoring model. The prosody model that is built from the teacher’s speech p...
متن کامل